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In quantum game theory, one of the most intriguing and important questions is, “Is it possible 
to get quantum advantages without any modification of the classical game?” The answer to this 
question so far has largely been negative. So far, it has usually been thought that a change of the 
classical game setting appears to be unavoidable for getting the quantum advantages. However, we 
give an affirmative answer here, focusing on the decision-making process (we call ‘reasoning’) to 
generate the best strategy, which may occur internally, e.g., in the player’s brain. To show this, we 
consider a classical guessing game. We then define a one-player reasoning problem in the context 
of the decision-making theory, where the machinery processes are designed to simulate classical and 
quantum reasoning. In such settings, we present a scenario where a rational player is able to make 
better use of his/her weak preferences due to quantum reasoning, without any altering or resetting 
of the classically defined game. We also argue in further analysis that the quantum reasoning may 
make the player fail, and even make the situation worse, due to any inappropriate preferences. 

PACS numbers: 03.67.-a, 02.50.Le 


I. INTRODUCTION 


Game theory, which is a very well established disci¬ 
pline in mathematics [l|, has successfully been applied 
to various fields, such as social science Q, evolutions 
in biology Q, and economics [5[. At the abstract level, 
game theory mainly deals with legitimate strategies and 
scores of the players. Thus, a game is defined by the 
strategies on one hand, and by a specification of how 
to evaluate the players’ game scores on the other hand. 
Recently, physicists have been attempting to generalize 
the game into a new scenario finding common theoret- 
ical properties between the game and quantum theory 
|6l-[lQj. Of particular interest to this generalization is to 
study whether it is possible to replace the classical strat¬ 
egy with a quantum strategy for getting quantum advan¬ 
tages, if any [HI- The quantum advantages from such a 
generalization have been found to be relevant to these 
games. For example, consider the “penny-flip game” [6|, 
where two players take turns choosing whether or not to 
flip a penny inside a box, and the starting player opens 
the box to identify if the penny is flipped from its start¬ 
ing position or not. Here, if one player can adopt a quan¬ 
tum penny, then he/she has a better chance of winning 
assisted by quantum superposition. Another celebrated 
example is the “Prisoner’s game” [3,[il, where two players 
face a dilemma, since acting rationally for their own in¬ 
terests would result in a collectively worse outcome. But 
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this dilemma can also be solved by adopting quantum 
strategies that the players can realize. Most recently, 
some new game scenarios have been conceived, that es¬ 
tablish a strong link to communication complexity [a 
and Bell-inequality engaging the nonlocality [l3|, . 

Following up on the successes of the previous studies 
of the quantum game, we also plan to explore a posi¬ 
tive role of the ‘quantum’ in a classically designed game. 
In particular, we consider the following question, “Is it 
possible to get quantum advantages without any quan¬ 
tum modification?” This question is important because 
nearly all games are allowed to have the advantages due 
to “quantum strategies”, but it has usually been thought 
that one inevitably needs to change the original form 
of the classical game to enjoy the quantum advantages 
Therefore, the answer to the aforementioned 
question has been negative. However, here we find an 
affirmative answer, focusing on — substantially differ¬ 
ent from the earlier approaches — the decision-making 
(we call “reasoning” hereafter) of the rational player. To 
show this, we design a classical two-player game, called 
the Secret-Bit Guessing Game, where one player named 
Bob attempts to guess the secret bits of the other player, 
Alice. For this game, we map out two parallel ways of 
Bob’s reasoning to choose his best answering strategies: 
one is classical probabilistic, and the other is quantum. 
Each reasoning that is drawn in Bob’s brain is modeled 
as a machinery process for systematic analysis and fair 
comparison. On the basis of the payoff-function analy¬ 
sis, we explicitly show that the quantum reasoning can be 
more advantageous without changing the classical setting 
of the game. This is because the rational player. Bob, can 
make better use of his weak preferences, faithfully deal¬ 
ing with quantum superposition. However, we also ar- 
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gue in further analysis that the quantum reasoning may 
frustrate Bob, and even make the situation worse due 
to malicious hints that can lead Bob to have the wrong 
preferences. 


II. PRELIMINARY 


A. A secert-bit guessing game 


Firstly, we consider a secret-bit guessing game [illii. 
In the game, Alice generates two bits and keeps them 
on her memory Mx {x = 0,1). Here we note that the 
identity of the bits are in Mx , regardless of whether any¬ 
one can access it or not (i.e., the secret bits are classi¬ 
cal). Then, the other player Bob chooses his answering 
strategies i^Bob ^ {0,1} to guess the secret bits enveloped 
by Alice, considering four possible strategies UA\ice{x) 
(x = 0,1) that Alice may have. Specifically, 


' [r.l] itAiice(a:;) = 0, 

[t. 2] UAiice(a:) = x, 

[t.3] UAiice(a:) = 1 © a;, 

, [t. 4] itAiice(a;) = 1, 


( 1 ) 


where ‘0’ denotes modulo-2 addition. Here, if Bob’s an¬ 
swer is correct (i.e., r^Bob = Alice) for a given x. Bob 
wins a single-point | and Alice loses the same single¬ 
point. Otherwise, in case Bob gives wrong answer (i.e., 
UBoh 7^ '^Aiice)? Alice and Bob get the single-points ^ 
and — respectively. This game is thus defined as 
Q = Q{S^P), where S and P denote the (non-empty) 
sets of the players’ strategies (r^Aiice, r^Bob) and game 
scores (^AUce, ^Bob), respectively. Noting that Bob makes 
two answers for Mx {x = 0,1), the possible game scores 
for Alice and Bob after one game are made by adding 
the two single-points, and thus ^AiiccCBob ^ 

Note further that ^Aiice is equal to —^Bob? or ^uivalently, 
Caucc + ^Bob = 0; i-o., our game is zero-sum [l[. 


B. One-player reasoning problem 
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FIG. 1: (Color online) For each set of Alice’s strategies 
UA\ice{x) {x = 0,1), we specify the regions of well-quantified 
probabilities of Bob’s preferences in the space of (oo, oi). 


X G {0,1}, depending on the coefficients (ro,ri) G R. 
Note that the function in Eq. ([2]) can generate all possi¬ 
ble sets [t.1]-[t.4] of UA\ice{x) in Eq. ([1]). Here we con¬ 
sider the concept of a hint given from, e.g., a helper jioj . 
which allows Bob to have (‘weak’ or possibly ‘strong’) 
preferences over R. Thus, we can formulate our prob¬ 
lem (R, P) with Bob’s preferences and alternatives R 
in the context of the theory of decision-making [l|. We 
note that the hints are presented in abstract form. We 
assume an “interpretation function” that quantifies his 
own preferences, such that [4l[ 

{Pr(ro -> k), Pr(ri ^ k')} G H {k, k' = 0,1), (3) 

where Fi{rj k) denotes the probability of choosing 
“rj ^ /c” (j, /c = 0,1), and H denotes the possible set of 
those probabilities. Here, Pr(rj ^ /c 0 1) = 1 — Pr(rj ^ 
k). Thus if Pr(rj k) > Bob wants to choose ^ 
/c”, at least as much as “rj ^ /c 0 1” (i.e., “r^ ^ /c” ^ 
“rj ^ A:0l”), and vice versa. More specifically, we write 
Pr(rj ^ k) as 


Whereas in previous studies the strategies have usu¬ 
ally been generalized in a quantum regime, our primary 
concern here is with the reasoning process. In particular, 
we would like to investigate if a quantum reasoning can 
yield a higher winning average compared to the classical 
ones even in a fully classical game. We now turn our 
attention to Bob’s reasoning to make his valid answering 
strategies i^Bob for x = 0,1. We define this a “One-Player 
Reasoning Problem.” 

To deal with this problem, we design a process of Bob’s 
reasoning by introducing a one-bit Boolean function, 

'?^Bob(^) = ro 0rix, (2) 

where r^^ri G {0,1}. Then, Bob’s reasoning is nothing 
but the process of making the output r^Bob(^) for a given 


= i + (-l7a,-, (4) 

where aj G [— ^5 is defined as a factor to represent the 
bias of Bob’s preferences. 

Here, if the direction of a = (ao,fri)^ is appropri¬ 
ately assigned, we say that the probabilities in Eq. (j3j) 
are well-quantified^ where a was defined as a vector on 
the two-dimensional space of (o^i, 0 ^ 2 ). Eor example, if 
Alice’s strategies UA\[ce{x) are [r.l], the well-quantified 
probabilities are characterized with the directional con¬ 
dition {ai > 0 and 0^2 > 0) (see Eig. [T]for all the cases). 
Bob is supposed to perform his reasoning, believing his 
ability to interpret the given hints. A schematic picture 
of our game is presented in Eig. [2l 
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FIG. 2: (Color online) Schematic picture of our game setting. 
Alice sets the two secret bits into her memory Mx (x = 0,1) 
and Bob attempts to guess them. In this game, we define 
one-player (Bob’s) reasoning problem with a certain set of 
probabilities of the player’s own preferences, as in Eq. 
Here, we replace the reasoning to the process of a machinery 
that consists of the corresponding internal devices involved 
(e.g., one inside the player’s brain). See main text in Sec. IIIIl 


III. TWO PARALLEL REASONING 
PROCESSES: CLASSICAL PROBABILISTIC, 
AND QUANTUM 

In order to perform a more systematic analysis, we re¬ 
place the reasoning to a machinery process, which may 
occur, e.g., in Bob’s brain. To this end, we consider a 
fledged computing module, as depicted in Fig. [2l to sim¬ 
ulate the reasoning process of Eq. m Hi- This comput¬ 
ing module consists of two one-way channels Cx and Cy, 
where Cx transmits the classical signals of the memory 
number x, and Cy deals with the signals of Bob’s strate¬ 
gies UBoh{x) {x = 0,1). Two probabilistic logic gates Rq 
and Ri are also placed in Cy^ but note that Ri acts con¬ 
ditioned on the input x in Cx being 1. Bob’s strategies 
'^Bob(^) are identified by the measurement at the end of 
Cy. We here introduce another internal machine, to be 
called an interpretation machine, to generate the proba¬ 
bilities Pr(rj ^ k) (j, /c = 0,1) of Bob’s preferences. 

With this general model of reasoning machinery, we as¬ 
sume that Bob can make two different types of reasoning: 
Classical probabilistic and quantum. Here, we note that 
the signals x in Cx should be classical even in the case of 
quantum reasoning, as it is regarded as an element of the 
game. Thus, we do not need to consider any additional 
internal process to convert the classical information to 
the quantum information in the reasoning, or vice versa 
[T^. This assumption is not trivial, as we have to make 
a fair comparison of the two reasonings, independently 
from the classically designed game . 

1) Classical probabilistic reasoning. - Firstly, let us 
assume that the signals of Cy are classical and the logic 
gates Rj {j = 0,1) act according to the probabilistic 
rule, to be either “U” with the probability Pr(rj ^ 0) or 
“NOT” with the probability Pr(rj ^1). In this case, Rq 
and Ri are classical probabilistic gates. Such a probabilis¬ 
tic instruction results in better computational outputs in 
a heuristic manner [2^, and could allow a reasonable 


Signal x=0,1 



FIG. 3: (Golor online) A realizable and concrete setting of 
Bob’s classical probabilistic reasoning. In such a setting, 
the analyzer (or the controller) receives the quantified prob¬ 
abilities from the interpretation machine, and performs the 
Monte-Garlo method by generating a (classical) random num¬ 
ber r G [0,1). Here, if the randomly generated r is smaller (or 
larger) than Pr(rj ^ 0), the switching device of Rj connects 
the incoming signal to ‘Identity’ (or ‘NOT’). Bob identifies a 
value of r^Bob(2:) for given x in the classical measurement. 
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FIG. 4: (Golor online) A setting of Bob’s quantum reasoning. 
Here we consider a linear optical implementation, where the 
signals of Cy are encoded as polarized single-photon states \H) 
and \V) (i7 = 0 and U = 1). The unitary gates Rj (j = 0,1) 
are realized by a set of wave plates (QWP-HWP-QWP) for 
the polarized photon [^. The analyzer maps the quantified 
probabilities in Eq. 0 to the control parameters of the wave- 
plates, using Eq. (|5]) and Eq. (|6]). Then, Bob’s brain performs 
the quantum measurement on the hnal output photon. 


comparison with the unitary gates adopted in the quan¬ 
tum reasoning (as described later). To make the strategy 
of the answer UBoh{x) for a given x, the final value 0 or 
1 passing through the gates Rj {j = 0,1) is identified by 
the classical measurement. In Fig. [3l we sketch a real¬ 
izable and concrete implementation of such a machinery 
process for the classical probabilistic reasoning. 

2) Quantum reasoning. - On the other hand. Bob can 
also follow the quantum reasoning, where the crucial part 
of the computing module, including the channel Cy, logic 
gates Rj {j = 0,1), and measurement device, are quan¬ 
tum. In such a case, each of the gates Rj {j = 0,1) is to 
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be a unitary transformation, defined as 


/ V^Pr (r^- ^ 0) v/Pr(rj ^ 1 )\ 

^FT{rj 1) — A/Pr(rj ^ 0) J ’ 


( 5 ) 


which also leaves and flips the states | 0 ) and | 1 ) with 
the probability Fi{rj 0 ) and Pr(rj ^ 1 ), respectively. 
However, it should be noted that the unitary gate Rj has 
an additional degree of freedom, i.e., quantum phase 
to exhibit the genuine property of the quantum superpo¬ 
sition [43|. It allows the rational player. Bob, to explore 
an additional rule for setting the phases (pj {j = 0 , 1 ) 
to maximize his winning averages. In our game. Bob 
additionally uses the directional condition of a, which 
could not considered in the classical probabilistic reason¬ 
ing. More specifically, Bob’s brain sets the phases cpj 
(j = 0 , 1 ), according to 


Here, we briefly comment that the two settings de¬ 
scribed above are parallel in the sense that the opera¬ 
tion of the logic gates are comparable in each reasoning. 
Note that, for the number of games, the operations of 
the classical probabilistic gates Rj (j = 0 , 1 ) can also be 
represented by a stochastic evolution matrix as 


^ fPv{rj 0) Pr(rj ^ 1)\ 
j YPr(rj ^ 1) Pr(rj ^ 0)y ’ 


( 7 ) 


where the matrix elements may provide the candidate 
transition probabilities which describe the dynamics of 
unitary transformation of Eq. (j5j) [i^ . 


IV. ANALYSIS OF BOB’S AVERAGE SCORES 
ACHIEVABLE FROM THE TWO REASONINGS 


{ (i) A = 0 if a^ai > 0 , 

{ii) A = I if a^ai = 0, ( 6 ) 

{in) A = 1 if aoai < 0, 

where A = . These rules were made to maximize 

Bob’s winning averages. Here, the case (i) describes the 
situation that Bob’s brain, internally, chooses A = 0 
when a contains the directional conditions (ao > 0 , 
ai > 0 ) or (q^o < 0 , q^i < 0 ), which are toward [r.l] 
and [t.3], respectively. In the case of {Hi), A is set to 

be 1 with (q^o > 0 , q^i < 0 ) or {a^ < 0 , > 0 ) whose 

directions are toward [t.2] and [t.4], respectively. How¬ 
ever, in the case of {ii), i.e., when q/q = 0 or o^i = 0 , 
it is not possible to find any useful setting, as a feasible 
direction of a cannot be sured. At the final step, Bob’s 
brain performs the quantum measurement on the final 
state to get UBoh{x). Note that, in the view of the intrin¬ 
sic probabilistic nature of the quantum system, the final 
state does not result in a definite or predictable outcome 
value. In Fig. IH a schematic example of such a quan¬ 
tum reasoning procedure is sketched in the linear-optical 
regime. 


A crucial task in game theory is to investigate a func¬ 
tion /$, which determines the average scores of the play¬ 
ers over the number of games. In our game, such a func¬ 
tion /$, named the payoff function, can be defined by 

: S X H X A ^ (^Alice ^ 1^5 ^Bob ^ ^) 5 (8) 

where SAiice and Seob denote the total average scores 
of Alice and Bob, respectively, and A denotes the set of 
possible reasonings. As mentioned before, our game is a 
two-player zero-sum game, so it is sufficient to analyze 
the score of one of the players. We thus focus on the 
average score Seob of Bob throughout the work. 

More specifically, the total average score Seob can be 
evaluated as 



4 

^ ^ ^Bob,r 5 
r=l 


( 9 ) 


where it is assumed that Alice chooses her bits at random. 
Here, ^Bob r ~ 15 2,3,4) is also defined as the score 
averaged for a specific set of Alice (^) {j = ^A)^ 


J 


‘^Bobj'i 


= E 

35 = 0,1 


Pr ('UBob(^) — '^Alice(^)) r)Br ('UBob(^) A ^Alice(^)) 


( 10 ) 


where Pr(uBob(a:) = UA\ice{x)) and Pr(MBob(a;) 7 ^ WAiice(a;)) are the probabilities that Bob’s answer is correct and 
incorrect for a given x, respectively, and r denotes the index of the possible sets of Alice’s strategies 

WAiice(a;)- For our later analysis, we here rewrite CBob.x) for each r, as 

?Bob.i = ^ (Pr(uBob = 0|a; = 0) + Pr(nBob = 0|a; = 1) - Pr(nBob = l|a; = 0) - Pr(nBob = l|a: = 1)), 

?Bob ,2 = ^ (Pr(uBob = 0|a; = 0) + Pr(nBob = Mx = 1) - Pr(nBob = Mx = 0) - Pr(nBob = 0|a; = 1)), 

?Bob ,3 = ^ (Pr(rtBob = l|a: = 0) + Pr(MBob = 0|x = 1) - Pr(MBob = 0\x = 0) - Pr(MBob = Ija:; = 1)), 

?Bob ,4 = ^ (Pr(rtBob = Mx = 0) + Pr(MBob = ^x = 1) - Pr(MBob = 0|x = 0) - Pr(MBob = 0|a:; = 1)), (11) 
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FIG. 5: (Color online) Bob’s average score (density-plot 
on the left, and 3D-plot on the right) with respect to |ao| and 
|ai |. Here we assume that Bob’s brain performs the reasoning 
with the well-quantified probabilities of his preferences. 



FIG. 6: (Color online) Bob’s average score (density-plot 
on the left, and 3D-plot on the right). The probabilities as in 
Eq. © are also assumed to be well-quantified, and Bob can 
chose appropriate phase factors (j)j (j = 0,1) following the 
rules in Eq. 


where Pr(?XBob|^) = 0,1) is the probability that Bob 
identifies UBoh{x), given the memory number x. In the 
following, we shall analyze the total average score ^Bob 
achievable from each of the two reasonings. 

1) Average seore aehievable from the elassieal proba- 
bilistie reasoning. - First, we write out explicitly the con¬ 
ditional probabilities Pr(i4Bob|^) {x = 0,1) in Eq. ([TT]) : 


Pr(rtBob = 0|x = 0) 
Pr(i4Bob = l\x = 0) 
Pr(i4Bob = 0|x = 1) 
Pr('UBob = l|x = 1) 


1 

1 

2 -«o, 

1 

- + OL^ai^ 

1 

- — a^ai. 


( 12 ) 


Here, from Eqs. ©-dm), we can derive that, if there is 
no biased value of the factor ol (i.e., q^o = = 0), and 

hence Bob’s preferences, then Bob’s total average score 
^Bob will be 0. In such a case. Bob would become in¬ 
different (i.e., ‘^’) to the choice of his strategies. How¬ 
ever, if Bob can have a finite non-zero value of ol for the 
given hints, the winning average can be improved. For 
example, when Alice’s strategies UA\[ce{x) are [r.l] and 
the probabilities of Bob’s preferences are well-quantified 
with the directional condition {oq > 0, q^i > 0) (as de¬ 
picted in Fig. [1]), Bob’s average score can be increased 
up to ao + 2(aoQ^i >0. By generalizing this advantage 
for other cases, it is found that Bob can have 


4oi> = l«o|+2K||ai|>0, (13) 

where the superscript ‘(C)’ means that the score is 
achievable from the classical probabilistic reasoning. In 
Fig. [3 the graphs of are given with respect to 

10^01 and |(ai|, assuming that the probabilities of Bob’s 
preferences are well-quantified for the given hints. How¬ 
ever, if Bob uses ill-quantified probabilities. Bob could 
have < 0, decreasing his winning average (see ap- 

pendix[3for details about this). This situation may arise 
when the hints are made with any malicious intention. 


2) Average seores aehievable from the quantum reason¬ 
ing. - To analyze Bob’s score achievable from the quan¬ 
tum reasoning, we also evaluate the conditional proba¬ 
bilities Pr(i4Bob|^) {x = 0,1) as 


Pr(i^Bob = 0|x = 0) = 
Pr('UBob = l|a:^ = 0) = 
Pr('UBob = 0|x = 1) = 
Pr('UBob = l\x = l) = 
where F is defined as 


f+ao, 

1 

--^0, 

- + a^ai H- F cos(7rA), 
i — aoOi — Fcos(7rA), (14) 


r = Q-KI')- ( 15 ) 

Here we readily see that the additional term “F cos(7rA)” 
appears in the case of x = 1. We note, again, that the 
factor A comes from the quantum phases (j)j {j = 0,1) 
involved in the unitary gates Rj {j = 0,1). Thus, by 
applying the rules of Eq. m, Bob can get (as long as 
0^0 7^ 0 and ai 7^ 0) 


'^(Q) _I ■p 

^Bob ~ ^Bob w J- , 


(16) 


where the superscript ‘(Q)’ denotes the score obtained 
by quantum reasoning. Here, by observing Eq. ([T6]^ , we 

can directly see that is always larger than or equal 
(c) 

to which means that Bob can increase his winning 

average more than in the classical probabilistic reason¬ 
ing, notably even in the ease where the elements of the 
game are all elassieal. The equality is satisfied when the 
given hints are perfect; namely, when |(ao| = \cli\ = 
This is quite natural, because if the given hints contain 
whole information of UA\ice{x), then Bob can have strong 
preferenees (i.e., V’) toward his 100% winning [45|. In 
Fig. [6l we give the graphs of for the well-quantified 

probabilities. However, we should point out that Bob’s 
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FIG. 7: (Color online) The simulation data of (blue 

circle) and (red circle) are plotted for the two reasonings: 
(left) Classical probabilistic, and (right) quantum. The solid 
lines are the theoretical values drawn by Eq. (fT3)) and Eq. ([T6l) . 
The data are very well matched to the theoretical lines. 


winning average can also be decreased due to any mali¬ 
cious hinting. In particular, could be much smaller 
than in the worst case (see appendix [X|) . 


1 


CD 

8 0.75 

CO 

CD 

D) 


CO 

CO 

-S 0.25 


0 

0 0.1 0.2 0.3 0.4 0.5 

|a| 

EIG. 8: (Color online) We present the data of (blue 

squre) and Sg^^ (red circle), by assuming that \a\ = |oo| = 
|oi|. The quantified probabilities in Eq. © are assumed to 
be good. Each data point is also averaged over 10^ trials of 
the game. Here we can see that the data are also very well 
matched to the theoretical (blue and red dashed) lines, drawn 
by Eq. (0 and Eq. m- 
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V. NUMERICAL SIMULATIONS 

We now demonstrate the results of our theoretical 
analysis through numerical simulations that are designed 
based on Fig. [3] and Fig. S) Firstly, we assume that 
Bob enjoys a finite number of games N following each 
of the two reasoning processes. Here, Alice chooses her 
secret-bits randomly in each game and Bob’s brain al¬ 
ways uses well-quantified probabilities (i.e., the certain 
values of |(ao| and \ai\ with appropriately assigned direc- 
tional conditions). In Fig. [71 we plot the data of 

and obtained from (left) classical probabilistic and 
(right) quantum reasoning. The data are plotted in the 
space of 10^01 and \ai\ (from 0.05 up to 0.45 at 0.05 in¬ 
tervals). Each point is made by averaging the scores 
over A = 10^ games. Note that the graphs are dupli¬ 
cating Fig. [5] and Fig. |6l Actually, the data are very well 
matched to the theoretical values (the solid lines) drawn 
by Eq. (US)) and m- 

We then plot the data of B^J^ (blue square) and 
(red circle) with respect to \a\ in Fig. |8l Here, we let 
\a\ = 10^01 = |<ai|. Each data point is also averaged 
over A = 10^ games, and the quantified probabilities 
in Eq. (|3|) are assumed to be good. The dashed (blue 
and red) lines denote the theoretical values, drawn by 
Eq. ([T3|) and Eq. ([T6|) . In this case, it is directly seen that 
the increments of Bob’s average scores from the quantum 
reasoning are higher than those from the classical prob¬ 
abilistic reasoning. Notably, degree of the increment is 
conspicuous when the amount of the Bob’s preferences 
are very weak (as long as \a\ 7^ 0). Actually, Bob can in¬ 
crease his average scores more than 0.5 when \a\ = 0.05 
from the quantum reasoning, whereas the increments al¬ 
lowed from the classical probabilistic reasoning are van¬ 
ishingly small. Note that B^J^ = = 0 when \a\ = 0. 


VI. SUMMARY AND DISCUSSION 

In summarizing, we have presented a classical two- 
player (Alice and Bob) game, called the Secret-Bit Guess¬ 
ing Game, where Bob attempts to guess what Alice’s bits 
are. Using this game, we designed a legitimate process of 
Bob’s reasoning using a simple Boolean function and de¬ 
fined one-player (Bob’s) reasoning problem in the context 
of the theory of decision-making. We then considered two 
parallel ways of Bob’s reasoning: One is classical proba¬ 
bilistic, and the other is quantum. We primarily inves¬ 
tigated whether or not Bob can get the quantum advan¬ 
tage, particularly without changing the classical setting 
of the game. We replaced each reasoning which may oc¬ 
cur in Bob’s brain to a machinery process with the corre¬ 
sponding internal devices. On the basis of the analysis of 
payoff function, we explicitly showed that Bob can make 
better use of his weak preferences with quantum reason¬ 
ing, faithfully dealing with quantum superposition. This 
quantum advantage was possible because the main log¬ 
ical operations present in Bob’s brain provided another 
degree of freedom due to the quantum phase, and this en¬ 
abled the rational player. Bob, to explore an additional 
way of using his weak preferences to maximize his chance 
of winning. The important scientific message of our study 
is: It appears to he possible to get a quantum advantage 
even in the ease where all strategies are elassieal. We ad¬ 
ditionally investigated (in appendix O that if the hints 
are made to deceive Bob, then Bob’s winning average 
can decrease in general. However, in the worst case, such 
a disadvantage becomes much more acute. Thus, the 
quantum advantage in our game was counter-balanced 
with malicious hinting, and allowed us to remark on how 
to maximize the potential quantum advantages in such a 
system (see also Ref. [22|). 
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As a response, one may consider that Bob’s brain can 
probabilistically simulate the single-qubit process with 
the classical stuffs, and duplicate the measurement out¬ 
comes which accurately compare with those from the 
quantum reasoning [^. However , th is does not mean 
“there is nothing the quantum” [^. In fact, to ar¬ 
gue that “a single-qubit cannot be viewed as a genuine 
quantum system” as “it can classically be simulable” is a 
long-standing problem, and for several years studies have 
shown that the single-qubit is incompatible with classical 
models in terms of temporal inequalities (Hi! , no-go 
theorems [26|, [l^l, operational auasi-Drobabilitv |28|. etc. 

We believe that our work can provide some intuition 
on how can we get a quantum advantage using classical 
information or classical data. This question is of partic¬ 
ular significance, since it may be related to some recent 
issues, e.g., in the field of quantum machine learning al¬ 
gorithm (see Ref. [29[ for more details). Our work is also 
expected to open up follow up studies across multiple 
disciplines, such as quantum cryptography and artificial 
intelligence. 



_ ((j) 

FIG. 9: (Color online) We depict Bob’s score ^ averaged 
for a specific set of UA\ie{x) {x = 0,1): (top-left) [r.l] (top- 
right) [t.2], (bottom-left) [t.3], and (bottom-right) [t.4]. We 
specify the regions of ‘good’ (red dashed box) and ‘bad’ (green 
dashed box) probabilities (see, also, Fig. [1]). 
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Appendix A: Further analyses: ‘well-quantified’ and 
‘ill-quantified’ probabilities of Bob’s preferences 


Here we give further analyses for well-quantified and 
ill-quantified probabilities of Bob’s preferences. Eirstly, 
we recall the total average score of Bob achievable 
from the classical probabilistic reasoning. Erom Eq. (pT]) 

and Eq. (USD, we write CB^b,r 2, 3,4) as below. 


v(C') 

SBob,l 


Q^o 2cro^i 5 


ao — 2aoai, 
—ao + 2aoai, 
—(Xq — 2(Xo(Xi. 


(Al) 


appropriately assigned directional conditions of a (see 
Fig.HJ . More specifically. Bob can have 

“£b,best = l“i| + 2 |ai| \a2\ > 0, (A2) 

as described in the main text [see Eq. m]- However, if 

the hinting is malicious. Bob may fail. To see this clearly, 

—(c) 

we draw the graphs of ^ for the cases r = 1, 2, 3,4 
(see Eig. [9]). In each graph, we specify the regions of 
well-quantified (red dashed box) and ill-quantified (green 
dashed box) probabilities in the space of (o^o, <xi). Here 
we can imagine any malicious hinting that misleads Bob 
toward the green dashed regions. This is the worst sce¬ 
nario for Bob, in which he will have the score 

'^Bob,worst ~ ~ l^ll ~ 2 |ctl| |ct 2 | • (A3) 

If Bob has to take into account all these situations, it is 
evident that Bob will have Esob = 0? because he cannot 
have any preferences for the given hints. 

Then, turning our analysis to the quantum reasoning, 
let us write ^g^^ by using Eq. (pTj) and Eq. (fT4|) . as 

-(Q) _ -(C) 

^Bob,l — ^Bob,l + TCOS (ttA), 

-(Q) _ -(C) 

SBob,2 — SBob,2 ~ TCOS (ttA), 

^Bob,3 = CBob ,3 + TCOS (ttA), 

CboM = CBob,4“rcos(7rA), (A4) 


It is clear that if there is no bias among the preferences, 
—(C) 

i.e., cto = cti = 0, then ^Bob r = ^ ^ = 1,2, 3,4. 

However, if Bob has a = (ao, <ai)^ 7 ^ (0,0)^ for the given 
hints. Bob can improve his winning average with the 


where E = 2 ^(i - jaip) (i - |q! 2 |^) [see Eq. (US])]. 

Here, it is also true that Bob cannot improve his win¬ 
ning average when ag = = 0. In such a case. Bob 
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FIG. 10: (Color online) We give the graphs of for (top- 

left) [r.l], (top-right) [t.2], (bottom-left) [t.3], and (bottom- 
right) [t.4]. We also specify the regions of ‘good’ (red dashed 
box) and ‘bad’ (green dashed box) probabilities. 


has CBob,r = 0 with cos (ttA) = 0 for all r = 1,2, 3,4. 


However, if he can use well-quantified probabilities in his 
quantum reasoning, the average score can be higher than 
Eq. (jA2[) . Specifically, Bob can have 

^Bob,best ~ ^Bob,best ^ J- , 

as described in the main text [see Eq. m]- However, 
there can also be malicious hinting, in which case Bob 
may fail, similarly to the classical case. Let us see 

the graphs of JboL in Fig- UHl where the regions of 
well-quantified (red dashed box) and ill-quantified (green 
dashed box) probabilities are also specified. Erom the 
same analysis as in the case of the classical probabilistic 
reasoning, we can see that Bob’s total average scores can 
be decreased. Notably, in the worst case, such disadvan¬ 
tages can be maximized as 


^(Q) _ ^(C') _ p / A 

^Bob,worst '^Bob,worst 

This implies that the quantum reasoning can make the 
situation worse. In the case where Bob cannot evaluate 
whether the given hints are good or not, it is not possible 
for Bob to improve his winnings, as described above. 
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